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CHROMATIC THRESHOLDS IN SPARSE RANDOM GRAPHS 


PETER ALLEN, JULIA BOTTCHER, SIMON GRIFFITHS 
YOSHIHARU KOHAYAKAWA AND ROBERT MORRIS 

Abstract. The chromatic threshold S^{H,p) of a graph H with respect to the random 
graph G{n,p) is the infimum over d > 0 such that the following holds with high probability: 
the family of iL-free graphs G C G(n,p) with minimum degree S{G) ^ dpn has bounded 
chromatic number. The study of S^{H) := was initiated in 1973 by Erdos and 

Simonovits. Recently 6^{H) was determined for all graphs H. It is known that S^{H,p) = 
SxiH) for all fixed p S (0,1), but that typically 5^{H,p) ^ ii p = o(l). 

Here we study the problem for sparse random graphs. We determine S^{H,p) for most 
functions p = p{n) when H G {K^, Cs}, and also for all graphs H with x(i7) ^ {3,4}. 


1. Introduction 

An important recent trend in combinatorics has been the formulation and proof of so-called 
‘sparse random analogues’ of many classical extremal and structural results. For example, 
Kohayakawa, Luczak and Rodl [11] conjectured almost twenty years ago that Szemeredi’s 
theorem on fc-term arithmetic progressions should hold in a p-random subset of {1,... ,n} 
if p 3> and that the Erdos-Stone theorem should hold in the Erdos-Renyi random 

graph G{n,p) if p 3> 77 ,-i/™' 2 (R')^ where m 2 {H) is dehned to be the maximum of over 

all subgraphs F G H with v{F) ^ 3. The study of these conjectures recently culminated in 
the extraordinary breakthroughs of Conlon and Gowers [5] and Schacht [16], who resolved 
these conjectures (and many others), and in the development of the so-called ‘hypergraph 
container method’, see [3, 15]. 

In this paper we study a sparse random analogue of the chromatic threshold S^{H) of a 
graph if, which is dehned to be the inhmum over d > 0 such that every if-free graph on 
n vertices with minimum degree at least dn has bounded chromatic number. The study of 
chromatic thresholds was initiated in 1973 by Erdos and Simonovits [7], who were motivated 
by Erdos’ famous probabilistic proof [6] that there exist graphs with arbitrarily high girth 
and chromatic number. Building on work of (amongst others) Luczak and Thomasse [13] 
and Lyle [14], the chromatic threshold of every graph H was hnally determined in [2], where 
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it was proved that 
( 1 ) 


Km e 


|r — 3 2r — 5 r — 2| 

\r -2’ 2r -3’ r - 1 J 

for every graph H with chromatic number x{H) = r ^ 2. For example, S^{K^) = 1/3 and 
5^{C2k+i) = 0 for every /c ^ 2, as was first proved by Thomassen [17, 18]. 

We will study the following analogue of in G{n,p). 

Definition 1.1. Given a graph H and a function p = p{n) G [0,1], dehne 

Sy-[H,p) := inf jd > 0 : there exists G > 0 such that the following holds 

with high probability: every id-free spanning subgraph 

G C G{n,p) with 5{G) ^ dpn satishes x(G) ^ g|. 

We call (id, p) the chromatic threshold of H with respect to p. 

Note that 1), so this definition generalises that of Erdos and Simonovits. 

We emphasise that the constant G is allowed to depend on the graph id, the function p 
and the number d, but not on the integer n. We also note that if, for some d, with high 
probability there is no spanning id-free subgraph of G{n,p) whose average degree exceeds 
dpn, then vacuously we have 5^{H,p) ^ d. 

In a companion paper [1] we showed that 5^{H,p) = for all fixed p > 0, and made 

some progress on determining 5^{H,p) in the case p = Here we begin the investigation 

of S^{H,p) for sparser random graphs. In particular, we will prove the following theorem, 
which determines S^{H,p) precisely for a large class of graphs and essentially all functions 
p = pin). We will write 7r(di) for the Turan density 1 — of a graph id. 

Theorem 1.2. Let H be a graph with x(dd) ^ {3,4}. Then 

{ 5xiH) if p > 0 is constant, 

7r(di) if p 1, 

1 if ^ 

The same moreover holds for all graphs H with x(id) = 4 and m 2 (id) ^ 2. 

We remark that (2) does not hold for all 3-chromatic graphs; for example, it does not 
hold when id = G 2 k+i for any k ^ 2, see Theorems 1.4 and 1.5 below. We suspect that 
it does not hold for every 4-chromatic graph, though we do not have a counter-example, 
see Proposition 3.6 and Conjecture 6.1. Note that if p then d^(di,p) = 0, since 

with high probability G{n,p) has an isolated vertex, so the condition in the dehnition holds 
vacuously. We also remark that the proof of Theorem 1.2 uses a general probabilistic lemma 
(Lemma 2.2), which appears to be new, and may be of independent interest. 

For graphs id with x(di) = 3, it was shown in [1] that the situation is signihcantly more 
complicated, even in the case p = . We will therefore restrict ourselves to studying one 

specihc family of particular interest, namely the odd cycles. (For a brief discussion of more 
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general 3-chromatic graphs, see Section 6 .) For we will show that the pattern is the same 
as in Theorem 1.2. 


Theorem 1.3. We have 


[ 1 

For 6 * 5 , on the other hand, the behaviour is 


p > 0 is constant 

7 ^- i /2 p I 
n ^ 

more complex. 


Theorem 1.4. IFe have 


<^x(C'5,P) 


"0 if p > 0 is constant 
I if < p < 1 

\ if p 

. 1 */ ^<P<n-3/h 


We believe that this is the hrst example of a random analogue of a natural extremal result 
which has been shown to exhibit several non-trivial phase transitions. Finally, for longer 
odd cycles we will obtain only the following partial description. 


Theorem 1.5. For every k ^ 3, we have 

{ 0 z/ p 3> 

I if n-(2*:-l)/2A: p ^-(2k-2,)/{2k-2) 

1 */ ^ < P < n-(2'=-b/2fc. 

We suspect that ( 5 ^(C 2 fc+i,p) = 0 for all p 3> see Section 6 , and that the 

techniques introduced in this paper could potentially be extended to prove this conjecture. 
However, we do not know what value to expect for 5^{C2k+ii'p) fhe remaining range. 

Finally, we remark that Theorem 1.5 proves a particular case of a much more general con¬ 
jecture (see [1, Conjecture 1.6]) which states that, in the range p = the 3-chromatic 

graphs H for which S^{H,p) = 0 are exactly the ‘thundercloud-forest graphs’, see Dehni- 
tion 4.1. Together with the results of this paper and those in [1], this conjecture (if true) 
would completely characterise S^{H,p) in this range. 

Organisation of the paper. We begin, in Section 2, by establishing that G{n,p) contains 
a small subgraph with high girth, chromatic number and expansion (see Proposition 2.1). 
This result will be used in the lower bound constructions for Theorems 1.2, 1.3, and 1.4. In 
Section 3 we prove Theorem 1.2 and construct an inhnite family of 4-chromatic graphs H 
with 7712 (i?) < 2 (see Proposition 3.6). In Section 4 we provide the lower bound constructions 
for Theorems 1.3 and 1.4. We remark that some of these constructions are given in a more 
general framework and reused in [1]. In Section 5 we prove the remaining upper bounds that 
imply Theorems 1.3, 1.4, and 1.5. We conclude with some open problems in Section 6 . 

We remark that we often omit floors and ceilings whenever this does not affect the argu¬ 
ment. 
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2. Small subgraphs of G{n,p) with large girth and good expansion 

In this section we will prove the following proposition, which will be a key tool in the proof 
of Theorem 1.2, and in the lower bound constructions in Theorems 1.3 and 1.4. 

Proposition 2.1. For every k E N and e > 0, and every function p = o(l), the 

following holds with high probability: there exists a subgraph G C G{n,p) such that 

v{G) ^ e/p, x{G) ^ k and girth(G) ^ k, 

and moreover, for every pair A,B V{G) of disjoint vertex sets of size at least e\G\, the 
graph G[A, B] contains an edge. 

Observe that this result would be easy if t(G) ^ e/p were replaced with v{G) ^ K/p for 
some large constant K, because G{K/p,p) with high probability has the desired properties. 
So the difficulty is to obtain a much smaller subgraph with these properties. Similarly, 
a random graph on m := e/p vertices with kfm edges has a subgraph with the desired 
properties (see Lemma 2.3). This is denser than our G{n,p) but we will show that with high 
probability some set of m vertices contains exactly k'^m edges (see Lemma 2.4). 

However, this is not enough to conclude that G{n,p) contains a subgraph with the desired 
properties. Indeed, writing Ea for the event that A contains exactly k‘^\A\ edges, and F for 
the event that the desired subgraph G exists, we just argued that 

P| —o(l) and P(F I Ea) = 1 — o(l) for every |H| = m. 

^ \A\=m ' 

Now suppose that each Ea were the disjoint union of F and E'a, where the E'a are disjoint 
events, with P(Ea) P(E) for each i, but P(E) Z]|A|=m other words, each 

event F\Ea is very likely for the same reason. If this were the case then F would be a very 
unlikely event. 

In this scenario, however, the random variable X counting the number of Ea which occur 
is not concentrated near its expectation, that is, Var(X) is not small compared to E[X]^. 
We will show in Lemma 2.4 that in reality that is not the case. This combined with the 
following lemma suffices to prove Proposition 2.1. We formulate this lemma for a general 
setup and believe it is likely to have other applications. 

Lemma 2.2. Let Ei,Et andF be events with non-zero probability, andletX = 

Suppose that for some 6 ^ 0 we have Var(X) ^ 5E[X]^ and P(E | Ej) ^ 1 —d for each j E [t]. 
Then P(E) ^1 — 66. 

Proof. By Chebyshev’s inequality, we have P(X ^ |E[X]) ^ 45. Therefore, if we define 

Q ;= F^n{X > lE[X]}, 

it will suffice to prove that P(Q) ^ 2S. 

So suppose that to the contrary we have P(Q) > 26. Observe that 

t t t 

Y^nEW'Q) = > p«)TE|xi > i-Enu). 

j=l uj&Q j=l j=l 
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where the hrst inequality follows from the dehnition of Q. By the pigeonhole principle, it 
follows that P(-Ej n Q) > 5 ■ P(-Ej) for some j G [t]. But then 

which contradicts our assumption P(F | Ej^ '^1—6. □ 

In order to deduce Proposition 2.1 from Lemma 2.2, we need to bound the variance of the 
number of m-sets A with exactly k'^m edges of G{n,p), and show that such a set is likely to 
contain a graph G as in the statement of the proposition. We begin with the latter claim, 
which follows from a standard argument, originally due to Erdos [6]. 


Lemma 2.3. For all sufficiently large k the following holds. Let H be chosen uniformly from 
the set of graphs with m vertices and k^m edges. Then, with high probability as m ^ oo, 
there exists a spanning subgraph G F H with 


x(G) ^ k and girth(G) ^ k, 


and moreover, for every pair A,B F V{H) of disjoint vertex sets with |A|, \B\ ^ m/ik, the 
graph G[A, B] contains an edge. 


Proof. Given A and B vertex disjoint sets of size exactly m/Ak, since e{^F[[A, Bfj is a hyper- 
geometrically distributed random variable with expected value 

/ \ 

/ /JOO \ 

|zl| ■ \B\ ■ k'^m 


m 
^ 16’ 


we have (see for example [8, Theorem 2.10]) 

[e{H[A,B]) ^ ^ 


By the union bound, the probability that there exist A and B in B such that e{H[A, 5]) ^ 
m/32 is at most 

I g—m/200 ^ g—m/300 

where the inequality holds since k is sufficiently large. 

Now let Z denote the number of cycles in H of length at most k. Then 



E[Z] s; Y, 


m 


e=3 


(/) 


k'^m — i 


(?) 

k'^m 


-1 




E 

e=3 


m 


2k^ 


m 


^ {2k 


2\k+l 


By Markov’s inequality, it follows that with high probability we have Z ^ and in partic¬ 
ular with high probability H is such that the following holds. By removing one edge from 
each cycle of length at most k, we obtain a subgraph G with girth(G) ^ k and such that 
for every pair A,BO V{H) of disjoint vertex sets with |A|, \B\ ^ m/Ak, the graph G[A,B] 
contains an edge. Then G has no independent set of size m/k, so x(G) ^ k. □ 
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Lemma 2.4. Let 0 < e < 1 and k eN, and suppose that p = p{n) satisfies 1. 

Let X be the random variable that counts the number of sets A C [n] with \A\ = m = e/p 
such that e[G{n,p)[A]) = kfm. Then 


where C = 


Var(X) ^ 


Cvr? 

n 


E[X]^ 


Proof. Let s = k'^m. Observe that the expectation of X is 

(3) E|X1= 

We now calculate E[X^]. We need to bound the expected number of pairs (A, B) of sets of 
size m, each with exactly s = k'^m edges. Let i = |yl fl i?| denote the size of the intersection 
of these sets, and let t = e(A fl B) denote the number of edges contained in both sets. We 
have 





Note that ^ siiice 2m ^ n, and that ^ ^ ( 

Thus 



” V^f2!!!V7"y and 

\m — tj \m — I) \m — 1) V ^ / 



and hence 





(3} 



2t 

p~\l 


Substituting (f) ^ i^Y and (Wf ^ (^Y, and recalling that pi ^ pm = e, and thus 
(1 - p)-^ ^ e^P^ = 0(1), we have 



Since {x/ty is maximised when t = x/e, and therefore has maximum it follows that 


m / 2 \ ^ 

E[X2] ^ * 


e=o 


exp 


8sH^ 

m‘^p 


Now, since i ^ m, pm = e and s = k'^m, we obtain 


E[A-] < E|Al^f:e°<'>(^)'exp(^) « ElAJ'^f 


e=o 


^ ) 


£=0 


Cm 

ni 


2 \ ^ 


for some C = This last sum is bounded above by a geometric series with ratio 

Cm?/n, which, since m = e/p and by our choice of p, is smaller than The sum is 
therefore bounded above by 1 + ‘iCm?/n^ as desired. □ 


We can now easily deduce Proposition 2.1. 

Proof of Proposition 2.1. Given e > 0, we can assume k ^ 4/e is sufficiently large. Set 
m = e/p, dehne Ea to be the event that G{n,p)[A\ contains exactly k‘^\A\ edges, and let F 
denote the event that there exists a subgraph G C G{n,p) such that n(G) ^ e/p, x(G) ^ k 
and girth(G) ^ k, and moreover, for every pair A,B<PV{G) of disjoint vertex sets of size at 
least e|G|, the graph G[A, B] contains an edge. We are required to prove that P(F) = 1—o(l). 
But this follows from Lemma 2.2 since the conditions of this lemma are satished for some 
5 = o(l). Indeed, since p = o(l), we have Var(X) = o(E[X]^) by Lemma 2.4, where 

X = t)y Lemma 2.3 we have P(F | Ea) = 1 — o(l). □ 


3. Proof of Theorem 1.2 

To prove Theorem 1.2, we will use some known results. The hrst is the so-called sparse 
random Erdos-Stone theorem, which was originally conjectured by Kohayakawa, Luczak 
and Rodl [10], and proved by Conlon and Gowers [5] (for strictly balanced graphs H) and 
Schacht [16] (in general). 

Theorem 3.1 (Gonlon and Gowers, Schacht). For every graph H, every 7 > 0, and every 
p 3 > n~^C' 2 {H)^ following holds with high probability. For every H-free subgraph G C 

G{n,p), we have 

In particular, 6^{H,p) ^ 7 r(iL). 

The bound on p in Theorem 3.1 is sharp, as is shown by the following proposition. 

Proposition 3.2. For every graph H, every 7 > 0, and every p 

following holds with high probability. There exists an H-free spanning subgraph G C G{n,p) 

with 

S{G) ^ (1 — 2'y)pn and x(G) ^ ■ 

In particular, 5^{H,p) = 1. 
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For the proof we need the following lemma, which is a straightforward consequence of the 
celebrated ‘polynomial concentration inequality’ of Kim and Vu [9]^. Recall that a graph F 
is said to be 2-balanced if ^ subgraph F' C F with v{F') ^ 3. Given 

a graph F and a vertex v, let Xp{v) denote the random variable that counts the number of 
copies of F in G{n,p) that contain v. 

Lemma 3.3. Let F be a 2-balanced graph with m 2 {F) > 1, and suppose that p = p{n) 
satisfies ^ p ^ Then 

(4) v{xAv) 3^ E[AV(«)] + ^ 4 

\ log n J 

for all sufficiently large n. 

In order to prove Lemma 3.3, we will need to recall the main theorem of [9]. For each set 
S C E{Kn), let Ep{S,v) denote the conditional expectation, given that S C E[G{n,p)), 
of the number of copies of F in G{n,p) which contain v and use all the edges in S. Let 
Epiv) = max 5 ^ 0 n), and Ep{v) = max|i?^(n), E[Xir(n)]}. The main theorem of [9] 
implies^ that 

(5) p(Xir(t;) ^ E[XF(n)] + {Ep{v)E'p{v)y'^^ ^ 

if n is sufficiently large. 

Proof of Lemma 3.3. In order to apply (5) we need to bound Ep{S,v) from above for each 
non-empty set of edges S. We claim that, for each such S, we have 

(6) Ep{S,v) ^ pn(logn)“^®*'^^“"^. 

We now justify this claim. Given a non-empty set S of edges of iF„, let T be the set of 
vertices incident to the edges S. Observe that 

(7) Ep{S, v) ^ i;(F)TI^RO-|T|pe(F)-|S| _ 

Suppose first that |T| = v{F). Then Ep{S,v) ^ v{FY^^ = 0(1), from which ( 6 ) follows 
since p ^ ^ ^ Qj^ other hand, since F is 2-balanced, if 2 ^ |T| < v{F) then we 

have \S\ ^ m 2 {F)[\T\ — 2) -|-1. Since e{F) = m 2 {F)(v{F) — 2) -|-1 and m 2 {F) > 1, it follows 
that 

e(F) - 1^1 = m 2 {F){v{F) - |T| - l) + 1 + c 
for some c > 0, and hence, from (7), that 

Ep{S,v) ^ ^ ^ pn(logn)“^®*^^^“"^ 

if n is sufficiently large, by our choice of p. This proves (6). 

^An anonymous referee pointed out to us that one can avoid the polynomial concentration machinery by 
using a very nice trick of Krivelevich [12]. 

^To obtain (5), we apply the general theorem stated in [9] to the polynomial corresponding to the random 
variable Xf{v), with A = 2e(F) log ( 2 ). 



Now, to deduce (4), simply note that E[Xi7’(u)] ^ p^iP)n"(P) i ^ pn^ since p ^ n 
and F is 2-balanced. Thus, by (6), we have 

Ef{v)Ep{v) ^ 

and hence, by (5), we have 

as required. □ 

We can now easily deduce Proposition 3.2. 

Proof of Proposition 3.2. Suppose hrst that p ^ (logy^)^^ ^ ^^^gg let F C iP be a 

subgraph of H with = m 2 {H), and note that F is 2-balanced, and that m 2 {F) = 

7712(TT) > 1, since otherwise the statement is vacuous. Since p jj^ipHes that 

E[Xj7’(u)] = o{pn), it follows by Lemma 3.3 that, with high probability, every vertex of 
G{n,p) lies in at most ■ypn copies of F. 

On the other hand, if p ^ , then we may take F to be an arbitrary cycle in F[. 

(If iL is a forest then m 2 {H) ^ 1 and again the proposition holds vacuously.) To estimate 
the number of pairs of copies of F in G{n,p) both containing v, observe that if two such 
copies of F intersect in a set of edges S and vertices T, then T contains v and thus we have 
[S'! ^ |T| — 1. Furthermore, e(F) = v{F), and so the expected number of pairs of copies of 
F in G{n,p), both containing v, is at most 

^ Q^^2e{F)-\S\^2v{F)-\T\-l'^ ^ (logT^^ ^ 

SFE{F) ^ 

Hence, by Markov’s inequality, with high probability every vertex of G{n,p) lies in at most 
one copy of F. 

Now, by Chernoff’s inequality, G{n,p) has minimum degree at least (1 — 'y)pn, and each 
X C V(^G{n,p)) with |X| ^ yn contains more than p|Xp/4 edges. If all three of these likely 
events occur, then we can construct the desired graph G C G{n,p) simply by deleting one 
edge from each copy of F in G{n,p). 

To see that G has the required properties, observe hrst that G is F-free, and therefore 
F-free. Next, note that we have deleted at most ■ypn edges at each vertex of G{n,p), so we 
have S{G) ^ (1 — 27 )pn, as claimed. Finally, if |X| ^ then, since at most ypn^ edges 

of G{n,p) are deleted to obtain G, and X contains more than ypn^ edges of G{n,p), the set 
X is not independent in G. It follows that x(G) ^ as desired. □ 

For constant p, on the other hand, the chromatic threshold was determined in [1]. 

Theorem 3.4. For each constant p > 0 and graph H, we have S^{P[,p) = 

This together with the following lower bound on 5^{H,p) establishes Theorem 1.2. 

9 



Theorem 3.5. Let r ^ 4, s G N and 7 > 0. If p = pin) satisfies n ^ p 1 as 
n —)■ 00 , then with high probability G(n,p) contains a spanning subgraph G with 

xiG) ^ s and (5(G) ^ ^ —L^pn, 

such that every s-vertex subgraph of G is (r — 1)-colourable. 

In particular, 6^iH,p) ^ 7r(iJ) for every graph H with xiH) ^ 4. 

Before proving this theorem, we hrst spell out the deduction of Theorem 1.2. 

Proof of Theorem 1.2. If p > 0 is constant then the result follows from Theorem 3.4, and if 
p it follows from Proposition 3.2. Moreover, the upper bound in 

the range p 3> follows by Theorem 3.1. 

It remains to show that 5^(11,p) ^ 7 r(i?) for every 1 . If iP is bipartite 

then this is immediate, since niH) = 0, so let us assume that xiH) ^ 4 and mfiH) ^ 2. 

(Note that m 2 {II) > 2 for every graph H with xiH) ^ 5. Indeed, if m 2 {II) ^ 2 then 
e{F) ^ 2v{F) — 3 for every subgraph F F H with more than one vertex. In particular, this 
implies that every subgraph of H has a vertex of degree at most 3, and hence xiH) ^ 4.) 
But now ^ , and so the claimed bound follows from Theorem 3.5. □ 

Now we will use Proposition 2.1 to prove Theorem 3.5. 

Proof of Theorem 3.5. To construct G, we hrst partition the vertex set into sets X and Y, 
with |X| = n/(r — 1), and expose the edges of G(n,p) contained in X. By Proposition 2.1, 
with high probability, there exists a subgraph F on at most e/p vertices with girth and 
chromatic number both at least s + 1. We hx one such F. 

We next expose the edges of G(n,p) between ViF) and Y, and let lu = Niu) OY for each 
u e ViF). Set 

V, = {X\V{F))U U 4 

uGV{F) 

and let V 2 U ■ ■ ■ U K-i be an arbitrary equipartition of F \ U«ev(F) Note that we have 
not yet revealed any pair between any Vi and Vj with i j. We reveal them now. 

We are now ready to dehne G to be the spanning subgraph of G(n,p) with edge set 

EiF) U [uv : u G ViF), v G lu} U [uv G G'(G(n,p)) : u e Vi, v G Vj, i j}- 

We claim that, with high probability, G has the desired properties. Indeed, note hrst that 
xiG) ^ xiH) > s. Next we (implicitly) use several Chernoff bounds to show that G has 
sufficiently high minimum degree. Indeed, since p 3> and viF) ^ e/p, with high 

probability we have | U«Gy(F) ^ P^ ' '^(-^) ^ which implies |Id| ^ ^ — ^n. So with 
high probability every vertex u E Vi has at least (^ — 7 )p^^ neighbours in IJjyi every 

vertex u G F(F) has at least — x)pn neighbours in Y (because the edges between F(F) 
and Y were only revealed after hxing F), as required. 

Finally, we claim that x(G[IF]) ^ r — 1 for every s-vertex subset W C F(G). To show 

this, hrst colour W V Vi with colour i for each 1 ^ i ^ r — 1. Now, the remaining vertices 
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Figure 1. the gadget Go 

W r\V{F) induce a forest in G, because F has girth greater than s, and their neighbours 
in G are all in Vi, by construction. We can therefore complete a proper colouring of G[fF] 
using only colours 2 and 3 inside F. Since r ^ 4, this completes the proof. □ 

We hnish this section by noting that Theorem 1.2 does not cover all graphs of chromatic 
number four. 

Proposition 3.6. There exist infinitely many graphs F[ with = 4 and m 2 {H) < 2. 

Proof. We will construct a graph Go, which we call a gadget (see Figure 1), with 12 vertices 
and 19 edges, and which satishes the following key property: there are (non-adjacent) vertices 
x,y eV (Go) such that in any proper 3-colouring of Go the vertices x and y receive different 
colours. In other words, if we replace an edge xy of a larger graph G by a gadget on xy, 
then (when 3-colouring G) the gadget plays the same role as the edge. 

To dehne the gadget, set F(Go) = {x,y} U : 1 ^ i ^ 5} and 

E{Go) = {miM2,M2M3,M3M4,M4M5,M5Mi} U 

U [xui^xui.^xvi^xV'i] U {yu2,yUi,yv2,yVi] U {u^v^}. 

Now, let c: F(Go) —)■ {1,2,3} be a proper colouring, and suppose that c{x) = c{y) = 1. 
Then none of the vertices of {ui,Vi : 1 ^ i ^ 4} receives colour 1, and therefore it follows 
that 0(^5) = 0(^5) = 1, a contradiction. 

Now, given a graph H with x(hf) = 4, we can construct a graph F[' with 

X{H') = 4, v{H') = v{H) + 10 and e{H') = e{H) + 18, 

simply by replacing any edge of FT by a copy of the gadget. Moreover, if m 2 {H) < 2 then 
one easily checks that m 2 {H') < 2. Thus, in order to construct the claimed inhnite family 
of graphs, it suffices to construct a single example. 

In order to do so, consider the graph Fd obtained by replacing every edge xy of by a 
copy of the gadget. The resulting graph has 6 • 12 — 4 ■ 2 = 64 vertices and 6 ■ 19 = 114 edges. 
Moreover, it may be easily checked that m2 (FT) < 2, as required. □ 
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4. Lower bounds for Theorems 1.3 and 1.4 

By Theorem 3.4 we have that | and = 0 for any constant p > 0, 

and by Proposition 3.2 we have 6^{H,p) = 1 whenever p ^ 

section we will provide three constructions (see Propositions 4.2, 4.3 and 4.5) which imply 
the remaining lower bounds for 6^{Ks,p) and S^{C 5 ,p) in Theorems 1.3 and 1.4. However, 
since these constructions are also needed in [1], we shall, instead of giving them for H = 
and H = provide them in the following more general setting. 

Definition 4.1. A graph if is a cloud-forest graph if there is an independent set I C V{H) 
(the cloud) such that V{H) \ I induces a forest F, the only edges from J to F go to leaves 
or isolated vertices of F, and no two adjacent leaves in F send edges to /. 

Moreover, F is a thundercloud-forest graph if there is a cloud / C V{H), which witnesses 
that H is a cloud-forest graph, such that every odd cycle in H uses at least two vertices of J. 

Note that is not a cloud-forest graph, C5 is a cloud-forest but not a thundercloud-forest 
graph, and C 2 k+i is a thundercloud-forest graph for every k ^ 3. 

The following proposition implies 5y^{K^,p) ^ ^ for p{n) = o(l). 

Proposition 4.2. Let H be a graph with x(iP) = 3, and suppose that p{n) = o(l). 

If F[ is not a cloud-forest graph, then 6^{H,p) ^ |. 

Similarly, the next proposition implies S^{C 5 ,p) ^ | for p(n) = o(l). 

Proposition 4.3. Let H be a graph with x{H) = 3, and suppose that p{n) = o(l). 

If H is not a thundercloud-forest graph, then S^{H,p) ^ i. 

The constructions that prove these propositions are similar to (though somewhat more 
complicated than) that in the proof of Theorem 3.5 and rely, again, on Proposition 2.1. 

Proof of Proposition 4.2. We will show that, for every s G N and 7 > 0, with high probability 
G{n,p) contains a spanning subgraph G with 

X{G) ^ s and 5{G) ^ Q “ 

such that every s-vertex subgraph of G is a cloud-forest graph. This fact implies that 
S^{H,p) ^ 1/2 for every graph H that is not a cloud-forest graph. 

The construction of G is similar to that in the proof of Theorem 3.5, except that we will 
need to ensure that the neighbourhoods (in G) of the vertices of F are disjoint. To be precise, 
let us partition the vertex set into sets X and Y, with |X| = |T| = n/2, and expose the 
edges of G{n,p) contained in X. With high probability, we obtain (by Proposition 2.1) a 
subgraph F on at most e/p vertices with girth and chromatic number both at least s -t- 1. 
Next, expose the edges of G{n,p) between the vertices of F and Y, and for each u G V{F), 
dehne 

In = |nGT:iV(n)nH(F) = M}, 
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and note that the sets lu are pairwise disjoint. Observe also that |/„| is a binomial random 
variable with expected size |y| ■ p{l — ^ (l/2 — e)pn, and therefore with high 

probability \Iu\ ^ (1/2 — yjpn for every u G V{F). Now let G be the graph with edge set 

E{F) U [uv : u e V{F), v G /„} U [uv G E{G{n,p)) : u E Vi, v e V 2 }, 

where 

V, = {X\V{F))U IJ A and 1^2 := \ IJ 4- 

ugv{F) uev{F) 

Observe that x(G) ^ x{F) > s, and that S{G) ^ — 7)pn with high probability. 

It remains to show that for any set hh C (G) of at most s vertices, G\W] is a cloud-forest 
graph. To do so, we must hnd an independent set / C such that G\W \ J] is a forest, and 
the only edges from / to hh \ / go to non-adjacent leaves or isolated vertices of this forest. 
We claim that this holds for J = hh 0 V2, which is an independent set by the dehnition of G. 
Indeed, observe hrst that G\W \ /] is a forest, since W \ I F V{F) U Vi, the girth of F is 
greater than s, and since each vertex of Vi has at most one neighbour in V{F) U W- Now, 
every edge from I meets W \ I inVi, and (as just noted) every vertex oiW FVi is either an 
isolated vertex of G\W \ /], or a leaf. Since Vi is an independent set in G, all of these leaves 
are non-adjacent, and hence G\W] is a cloud-forest graph, as required. □ 

In order to prove Proposition 4.3, we will make use of the following construction of Luczak 
and Thomasse [13], see also [2, Theorem 14]. 


Lemma 4.4 (Luczak and Thomasse). For each s G N and 7 > 0, there exists a graph Fd 
with 



and a partition V{H) = AU B U G with the following properties: 

(а) \A\ ^ 7|iL| and x{H[A]) ^ s. 

(б) B and G are independent sets in H. 

(c) H[A,G] is empty and H[B,G] is complete. 

(d) Every odd cycle in Ff on s or fewer vertices uses at least two vertices of B. 


Proof of Proposition f.3. We will show that, for every s G N and 7 > 0, with high probability 
G{n,p) contains a spanning subgraph G with 


X{G) ^ s 


and 


S{G)^ 



pn 


such that every s-vertex subgraph of G is a thundercloud-forest graph. The existence of such 
a graph G implies that the chromatic threshold with respect to p of every graph that is not 
a thundercloud-forest graph is at least 1/3, as required. 

We choose e > 0 sufficiently small, and partition the vertex set into sets X and Y, with 
|X| = 2n/3 and |T| = n/3. As before, we reveal the edges within X and use Proposition 2.1 
to hnd a subgraph F with chromatic number and girth greater than s, on e/p vertices, such 

that any two disjoint subsets of V{F) of size at least e\V{F)\ have an edge joining them. 
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As before, we reveal the edges from V{F) to Y and for each u G V{F) let C y be the 
vertices of Y whose unique neighbour in V(F) is u. Now set, slightly differently than before, 

V,=X\V(F) and V 2 = Y\ \J 4. 

u£V{F) 

Let H be the graph whose existence is guaranteed by Lemma 4.4, and V {H) = AU B U C 
be the partition for which properties {a)-{d) hold. Let 

0: V{F){JVi ^ AUB 

be a function which maps v{F)/\A\ vertices of F to each element of A and |yi|/|i?| vertices 
of Vi to each element of B.^ 

We are now ready to dehne G. It is the spanning subgraph of G{n,p) with edge set 
jtin G E{F) : (j){u)4>{v) G E{H)^ U ^uv : u G V{F),v G 4| 

U jure G E{G{n,p)) : v E lu for some u G V{F), w G Vi, (l){u)(f){w) G E{H)^ 

U G E{G{n,p)) : u E Vi,v E • 

We claim that, with high probability, 

i(G) > Q - 37 

Indeed, similarly as before, by several applications of a Chernoff bound, with high probability, 
every vertex u E V{F) has at least |4| ^ (| — 'y^pn neighbours, every vertex of Vi has at 
least (I — 'y)pn neighbours in V2, and every vertex of V 2 has at least (| — 7)pn neighbours 
in Vi- Finally, if m G V{F) then, since S{E[) ^ — 'y)v{E[) and |A| ^ 'y\H\ (and in E[ all 

neighbours of (j){u) are in AU B), we have 

\{w eVi : (j){u)(j){w) E E{H)}\ > Q-27^n, 

and hence every vertex v E lu has at least (|—37)pn neighbours w E Vi such that (l){u)(l){w) E 
E{H). 

We next claim that x(G) ^ x(G[y(F)]) ^ s. Indeed, suppose that we colour G[y(F)] 
with fewer than s colours, and consider the colouring of H[A] in which the vertex a E A 
receives a most common colour in 0“^(a). This is a colouring of H[A] with fewer than s 
colours, so by Lemma 4.4 there is a monochromatic edge aa' of H[A]. Let c be the colour 
of a and a', and observe that there exist sets Z C and Z' C of colour c, 

each of size at least n(F)/(s|A|) > ev{F). Thus, since F is the subgraph obtained from 
Proposition 2.1, the graph F[Z, Z'\ contains an edge, and since aa! E E{H), this edge is also 
present in G. It follows that our colouring of G is not proper, and so x(G[y(F)]) ^ s, as 
claimed. 

^It is easy to see that we can adjust slightly the sizes of the sets so that all of these are integers. Alter¬ 
natively, we may allow some elements to receive an extra vertex. 
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Finally, we claim that G[hF] is a thundercloud-forest graph for every set W C V(G) with 
|fF| = s. To prove this, we need to show that there exists a cloud I <Z W which witnesses 
that FT is a cloud-forest graph, and is such that every odd cycle in G\W] uses at least two 
vertices of I. We will show that this is the case for the set 


/ = IF n Id- 

Note hrst that I is an independent set in G[fF], since Vi is an independent set in G. Next, 
observe that G\W \ /] is a forest, since the girth of F is greater than s, and since each vertex 
of IF \ (/ U V{F)'j is either in V2 and hence isolated in G\W \ /], or is in some lu so has at 
most M as a neighbour in G\W \ /]. Moreover, in the latter case u is not adjacent to any 
vertex of /, so that neighbours of I are non-adjacent leaves of the forest G[W\I], and thus I 
is a cloud. 

Finally, we need to check that every odd cycle in G\W] uses at least two vertices of J, so let 
S be an odd cycle in G[fF]. Note hrst that S <^V (F) UIJ^ lu since F has girth greater than 
s and each vertex of IJ^ lu has only one neighbour in V{F), and that every cycle containing 
a vertex of V 2 uses at least two vertices of Vi. 

Thus the only remaining case we need to rule out is that S consists of one vertex a; G W, 
whose neighbours in S are the vertices y,z E [Ju^u, and a path P with an even number of 
vertices in F from u to v, where y ^ lu and z E h- Since xy and xz and P are all edges of 
G, it follows that (f){u)(f){x), (f){v)(l){x) and 0(F) are all edges of H, which gives us a circuit 
of odd length in H. This circuit contains an odd cycle, which must (by Lemma 4.4) use 
at least two vertices of F, a contradiction (since = Fi). This proves that G\W] is 

indeed a thundercloud-forest graph, as required. □ 


We end the section with a slightly different (and easier) construction, which implies the 
remaining lower bounds in Theorems 1.4 and 1.5. 

Proposition 4.5. Let k ^ 2, and suppose that ^ p{n) n 3)/(2A: 2 )_ 

d^{G2k+up) > i. 

Proof of Proposition f.5. Let u = u{n) be any function tending to inhnity sufficiently slowly 
as n —)■ cxD and assume that 


^ p = p{n) ^ 1 


Given s G N and 7 > 0, we construct, with high probability, a G 2 fc+i-free spanning subgraph 
G C G{n,p) with x(G) ^ s and 5{G) ^ — 2'y^pn as follows. 

Partition the vertices into sets X and Y with |X| = |F| = n/2, and expose first the edges 
of G{n,p) contained in a subset X' C X of size u/p. We claim that, with high probability, 
there exists a subgraph F of G{n,p) in X' with maximum degree at most 2u, girth at least 
3k, and chromatic number at least log logo;. Indeed, this follows simply by removing vertices 
of degree greater than 2u, and a vertex from each cycle of length at most 3k. To spell out 

the details, observe that the expected number of independent sets in X' of size \X'\/y/Xogoj 

15 



tends to zero as n —)■ cxd, the expected number of cycles in X' of length at most 3k is at most 
3ku^^, and the expected number of vertices of degree greater than 2u is (by the Chernoff 
bound) at most Applying Markov’s inequality with each of these estimates in turn, 

we see that with high probability there are no independent sets in X' of size \X'\/yjlogoj, 
the number of cycles of length at most 3k is less than 3/ca;^^, and the number of vertices of 
degree greater than 2uj is at most e~^f^\X'\. If each of these events occurs, then, since u 
grows sufficiently slowly, the number of vertices of X' we must remove in order to remove 
all vertices of degree greater than 2u and cycles of length at most 3k is less than |X'|/2 for 
all large n. Now any graph with at least |X'|/2 vertices and independence number less than 
\X'\/yjlogoj has chromatic number at least log log n, so we obtain the desired F. 

We now give the pairs of vertices lying between X and Y an order r, in which the pairs 
with one end in V{F) come hrst, but which is otherwise arbitrary. We obtain a spanning 
subgraph G of G{n,p) by the following process. We start with E{Gq) = E{F), and then for 
each 1 ^ i ^ l-^ll^l we dehne Gj as follows. Let xy be the Ah edge in r. If xy G E(G{n,p)), 
and Gi-i U {xy} does not contain a copy of G2k+i-, and deg^. .^yj{xy} {y,V{F)) ^ 2ca, then 
we set Gi = Gj_i U {xy}. Otherwise we set Gi = Gj_i. We let G = G|x||y|- 

We claim that G is the desired graph. Indeed, Gq certainly contains no copy of G2k+i, and 
has the desired chromatic number, so by construction the same is true of G. It remains to 
show that with high probability S{G) ^ — 2'y}pn. We will bound the degrees of vertices 

in Y, then in X \ V{F), and hnally in V{F). Observe first that, by Chernoff’s inequality 
(and since p ^ ), the following events holds with high probability, where N(x) is the 

neighbourhood in G{n,p) of the vertex x: 

(a) |X(a;) O FI G (| ± j)pn for each x E X, 

(b) |X(|/) n X| G (I ± j}pn for each y E Y, and 

(c) there are at most vertices y eY such that |X(|/) 0 V{F)\ ^ 2u. 

Let us assume that each of these likely events holds. 

We claim that, for each y eY, the number of edges xy of G{n,p), with x E X, which are 
not present in G is stochastically dominated by Bm{u/p,p) + Bin(8/ca;^(pn)^^“^,p). Indeed, 
the number of edges of G{n,p) between y and V{F) is dominated by Bin(a;/p,p), and at 
worst we remove all such edges. The number of paths leaving y of length 2k is at most 
8 /ca;^(pn)^^“^, since in constructing such a path we never have more than pn choices for 
the next vertex, and at some point we must see a sequence of a vertex in Y followed by 
two vertices in F, and we have at most 2u choices for each vertex of F. Not all of these 
paths need be in any given Gi_i, but since there are certainly not more, since the Gj form 

a nested sequence of graphs, and since the event that the Ah edge of r appears in G{n,p) 

is independent of Gj_i, the claimed stochastic domination follows. By Chernoff’s inequality 
and the union bound, and by our choice of p, it follows that, with high probability, for each 
y eY there are at most 

'ypn ^ 2u:\ogn + lQkuj^p{pnf‘^~‘^ 

edges of G{n,p) incident to y not present in G, as required. 
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The proof is almost the same for vertices in X \ V{F). Indeed, given x E X \ V{F), it 
follows exactly as above that the number of edges xy of G{n,p), with y eY, which are not 
present in G is stochastically dominated by Bin( 8 /ca;^(pn)^^“^,p). (Note that in this case 
edges are only removed if they complete a copy of G 2 k+i-) As before, it follows that, with 
high probability, at most 'ypn edges of G{n,p) incident to x are not included in G for each 
a; eX\V{F). 

Finally, for each x G V{F) we claim that the number of edges xy of G{n,p), with y eY, 
which are not present in G is stochastically dominated by Bin(e“‘^AOj^^pj +Bin((2a;pn)^,p). 
The first term here counts edges xy which are in G{n,p) but not added to G because y had 
too many neighbours in V{F). The set of vertices in Y whose Gj_i-neighbourhood in V{F) 
has size [ 2 caJ is increasing in i. Since the probability that a given vertex of Y has at least [ 2 a;J 
G(n,p)-neighbours in V{F) is by Chernoff’s inequality at most by Markov’s inequality 

with high probability the total number of such vertices is at most Now since the 

event that the ith edge xy of r appears in G{n,p) is independent of Gj_i, the number of 
edges xy not included in G because y has too many neighbours in V{F) is stochastically 
dominated by Bin(e“‘^AOj^^pj_ Similarly, assuming that xy is the ith edge of r, the number 
of vertices y E Y such that there is a path of length 2k from x io y in Gj-i is at most 
{2u)^{pn)^, since (by the dehnition of r) every edge of such a path must be incident to F. 
As before, this implies the claimed stochastic domination. By Chernoff’s inequality and the 
union bound, and by our choice of p, it follows that, with high probability, for each x E V{F) 
there are at most 

edges of G{n,p) incident to x not present in G, as required. □ 

5. Upper bounds for Theorems 1.3, 1.4, and 1.5 

In this section we will determine 5^{Kz,p) and 5^{C^,p) for almost all functions p = pin), 
and 5^{C2k+i)P) for a certain range of p. Indeed, for the pieces are all already in place. 

Proof of Theorem 1.3. If p > 0 is constant then the result follows from Theorem 3.4 and 
the fact that 5y^{K^) = 1/3. Suppose next that pffi) = o(l), and recall that, 

by Theorem 3.1, we have S^{H,p) ^ 1/2. On the other hand, recall that is not a 
cloud-forest graph, so, by Proposition 4.2, we have S^{H,p) = 1/2 in this range. Finally, if 
p then it follows from Proposition 3.2 that 6^{H,p) = 1, as required. □ 

Next, we turn to the case H = C 5 . Recall that C 5 is not a thundercloud-forest graph. We 
have therefore already proved the following bounds: 

= 0 if p > 0 is constant, by Theorem 3.4, and since 5^{G;f) = 0, 

if p 1 , by Proposition 4.3, 

= 1 if p by Theorem 3.1 and Proposition 4.5, 

= 1 if ]£^ p by Proposition 3.2. 
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It therefore only remains to prove that S^{C 5 ,p) ^ 1/3 for all p = o(l). We remark 

that, under the stronger assumption p = this result follows from a general result 

(for all cloud-forest graphs) proved in [ 1 ]; the proof of the following result is similar, but 
signihcantly simpler. 

Proposition 5.1. S^{C 5 ,p) ^ 1/3 for every function <^p = o(l). 

To prove Proposition 5.1 we will use the sparse minimum degree form of Szemeredi’s 
Regularity Lemma. First recall the following dehnitions. 

Definition 5.2 ((£:,p)-regular pairs and partitions, the reduced graph). Given a graph G 
and > 0, a pair of vertex sets (A, B) is said to be {e,p)-regular if 

e{GlA,B]) e(G|A',y]) ^ 

lAIIBI |A'||y| 

for every X C A and YOB with |X| ^ e\A\ and |F| ^ £\B\. 

A partition V(G) = VoUViU.. .UV/ is said to be (e,p)-regular if |Vo| ^ en, |Fi| = ... = |I4|, 
and at most ek"^ of the pairs (V), Vj) with 1 ^ i < j ^ k are not (e,p)-regular. 

The {e,d,p)-reduced graph of an (e,p)-regular partition is the graph R with vertex set 
V{R) = {1,..., /c} and edge set 

E{R) = : [Vi^Vj) is an (e,p)-regular pair and e(G[I/, V/]) ^ | |}. 

We will use the following form of the Regularity Lemma, see [4, Lemma 4.4] for a proof. 
Note that although the statement there only guarantees lower-regularity of pairs in the 
partition, the proof explicitly gives an (£,p)-regular partition. 

Szemeredi’s Regnlarity Lemma (sparse minimum degree form). Let S,d,e > 0, 
/co G N and p = p{n) ^ (logn)"‘/n. There exists ki = ki{5, e, ko) G N such that the following 
holds with high probability. If G O G{n,p) has minimum degree 5{G) ^ 5pn, then there is an 
{e,d,p)-regular partition of G into k parts, where ko ^ k ^ ki, whose {e,d,p)-reduced graph 
R has minimum degree at least {5 — d — e)k. 

In the proof of Proposition 5.1 we will use the following fact, which is an easy consequence 
of Chernoff’s inequality: if p 3> then with high probability there are (l-|-o(l))p|17||R| 

edges between U and V for every pair of sets {U, V) with \U\ = Q{pn) and \V\ = 

Proof of Proposition 5.1. Let 7 > 0, and let G be a Gs-free spanning subgraph of G{n,p) 
with 

^(G') ^ Q + 27 ^pn. 

Applying the sparse minimum degree form of Szemeredi’s regularity lemma to G, with ko = 
1 / 7 , d = 7/10 and e > 0 sufficiently small, we obtain a partition V{G) = Vq U W U ■ ■ ■ U I 4 , 
such that the (e, d, p)-reduced graph R satisfies 
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Given any vertex n, we define the robust second neighbourhood N^i^v) of n in G to be the set 
of vertices w in G such that v and w have at least dp^n common neighbours in G. We dehne 

(10) W := {v e ViG) : |iV*(n) G ^ (1 + d) |Gi|} 

for each i E [k], and set Xq := V (G) \ (XiU ■ ■ ■ UX^). We will show that X* is an independent 
set for each 1 ^ i ^ k, and that Xq = 0. This implies that x(G) is bounded (by a constant 
depending on 7 ) as required. 

Indeed, let us hrst fix 1 ^ i ^ k, and suppose that uv G E{G[Xi\). Note that, by dehnition 
of Xj, the intersection (m) nX^ (n) is non-empty, so let w G X^ (m) nX^ (n). It follows that 

min {|X(m) n X(t(;)|, |X(n) G X(t(;)|} ^ dp^n > 4, 

since p 3> and therefore there exist distinct vertices u' and v' such that 

u' E N(u) C) N(w) and v' E N(v) O N(w). 

Since uv is an edge of G, it follows that uu'wv'v is a copy of G 5 in G, which is a contradiction. 
To show that Xq is empty, we will use the following claim. 

Claim: For each u G Xq, there exists an (e, d, p)-regular pair (G, Vj) such that 

\N;{u) G GI ^ e\Vi\ and |X*(m) G Vj\ ^ e\Vj\. 

Proof of claim. The claim follows from some straightforward edge-counting, together with 
the minimum degree conditions. Indeed, recalling that d(G) ^ (1 -|- 27 )pn, let U C N{u) be 
an arbitrary subset of size |17| = pn/?), and observe that (with high probability) there are 
at least pP'r?fStt edges incident to U, since e{G\U]) = oij?n^) (by Chernoff’s inequality) and 
each vertex in U has at least (1 -|- 27 )pn neighbours. 

Now, if |X 2 (m) G Vil ^ e|Id|, then there are (with high probability) at most 

2p ■ \U\ ■ e\Vi\ + dp^n ■ iGil ^ 2dp\ ■ |G| 

edges between G and U, so almost all of the edges incident to U go to sets V) with IX 2 (m) G 
hi I ^ ^|hi|- Moreover, since u E Xq, there are (with high probability) at most 

p\U\ ■ Q 2d^ |Id| -h dp^n ■ |Vi| ^ Q + 2d^p^n|hi| 

edges from Vi to U for every i E [k]. It follows that there must be at least 

indices i E [k] such that |X|(m) G Id| ^ e\Vi\. Since S{R) ^ + 'j'jk, it follows that there is 

an edge of R between two such indices, as required. n 

Now, suppose (for a contradiction) that there exists a vertex u E Xq, and let (V), Vj) 
be the pair given by the claim. By the dehnition of (e, d, p)-regularity, there exist vertices 
V E X 2 ( m ) G Vi and w E X 2 {u) G Vj such that vw is an edge of G, and (by the dehnition 

of the robust second neighbourhood) there exist distinct vertices v' and w' in the common 
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neighbourhoods of u and v, and of u and w, respectively. But then uv'vww' forms a copy of 
C 5 in G, and so we have the desired contradiction. □ 

Finally, let us prove Theorem 1.5. It follows from Proposition 3.2 that 5 -^{C 2 k+i,p) = 
1 whenever p ^-( 2 k-i)/ 2 k^ from Theorem 3.1 and Proposition 4.5 that 

5 ^{C 2 k+iiP) = 1/2 for every function 7 :i-( 2 fc-i)/ 2 A: p ^-i 2 k- 3 )/( 2 k- 2 )_ Thus, to deduce the 
theorem it will suffice to prove the following proposition. 

Proposition 5.3. 5 y,{C 2 i+i,p) = 0 for every i ^ 3 and p = o(l). 

Proof. Let 7 > 0, and let G be a G 2 £+i-free spanning subgraph of G{n,p) with 

(5(G) ^ 2'ypn. 

Applying the sparse minimum degree form of Szemeredi’s regularity lemma to G, we obtain 
a partition V{G) = Vq U lA U ■ • • U I 4 , such that the reduced graph R satisfies 5{R) ^ 7 /c. 
Dehne 

(11) X, := \y e V{G) : |iV*(n) n ^ d|l/|} 

for each i G [k], where N^i^v) is as defined in the proof of Proposition 5.1. Set Xq : = 
V (G) \ (Xi U • ■ ■ UXfc). We will again show that Xj is an independent set for each 1 ^ i ^ k, 
and that Xq = 0 . 

This time it is relatively easy to see that Xq = 0, since if n G Xq then |X 2 (n)| A 2dn, 
which implies that there are (with high probability) at most 

2p ■ \U\ ■ 2dn + dp^n ■ n = {S'yd + d)p^n^ < 'y‘^p^n^ 

edges incident to a set G C N{v) with \U\ = 2'ypn, contradicting our assumption on the 
minimum degree of G. 

Therefore, let us suppose that uv G E{G) for some u,v E Xi, and observe that N^i^u) fl Vi 
and iV|(n)nVi both have size at least d\Vi\. Let Vj be such that (IA,IA) (e, (i,p)-regular 
pair in G (this exists because S{R) > 0). We claim that there exists a path of length 2 i — 4 
in G[Vi, Vj] from N^i^u) fl V) to N^i^v) fl V) which uses neither u nor v. 

To hnd the desired path, we hrst construct a sequence of sets (i?i,..., 821 - 3 ) as follows. 
Set Bi = X 2 (m), and dehne Bt (for each 2 ^ ^ 2^ — 3) to be the set of vertices of G with 

at least 2 t neighbours in Bt_i. Observe that 

|G 2 i+i n 1/1 ^ (1 - e)|Vi| and \B 2 tP\Vj\ ^ {I - e)\Vj\ 

for every 1 ^ t ^ ^ — 2 , since the hrst time this fails we obtain a contradiction to the 
dehnition of an (e, (i, p)-regular pair. Thus 

\B 2 t- 3 r\N;{v)\ ^ |iV 2 *(n)nl/|-£|l/| ^ {d-e)\V\ > 2i. 

We can now choose the vertices (tci,..., 1021 - 3 ) of the path greedily, one by one, with wt G Bt, 
avoiding all previously-chosen vertices and u and v. Finally we choose vertices u' and v' (not 
so far used) in the common neighbourhoods of u and Wi and v and 1021-3 respectively, to 
complete a { 2 i + l)-vertex cycle. But we assumed that the graph G is G 2 ^+i-free, so this 
contradiction completes the proof of the proposition. □ 
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6. Open problems 


A large number of interesting questions about remain wide open, and we hope 

that the work in this paper will inspire further research on the topic. In this section we will 
mention just a few of (what seem to us) the most natural open problems. We begin with the 
following (somewhat tentative) conjecture, which says that the hypothesis of Theorem 1.2 
cannot be weakened to x(hf) 7 ^ 3. 

Conjecture 6.1. There exists a graph H with x{H) = 4 and a function p = p{n) with 

^-llm 2 {H) ^ ^- 1/2 

such that S^{H,p) < 7 i{H). 

We remark that, as far as we know, this might even be true for every graph H with 
x{H) = 4 and m 2 {H) < 2 and every function p in this range. It would therefore also be 
interesting to give a counter-example to this stronger statement. 

In the case x(Tf) = 3, we have barely begun to understand the behaviour of S^{H,p). An 
important next step would be to resolve the problem for all odd cycles. 

Problem 6.2. Determine 5 ^{C 2 k+i,p) for k ^ 3 in the range 7 :i-( 2 fc- 3 )/( 2 fc- 2 ) ^ 

We suspect that S^{C 2 k+i,p) = 0 for a much wider range of p than those we considered in 
Theorem 1.5; in particular, it seems plausible that by considering robust neighbourhoods 
for 2 ^ ^ A; — 1 (rather than just second neighbourhoods) the proof of Proposition 5.3 could 

be modihed to prove this for all p 3> Nevertheless, a signihcant gap remains, 

and we are not sure what to expect in the range 7 ^-( 2 fc- 3 )/( 2 fc- 2 ) ^ j^-(fc- 2 )/(fc-i)_ 

For more general 3-chromatic graphs, it follows from Theorem 3.1 and Proposition 4.2 
that if H is not a cloud-forest graph and p = p{n) satishes 

(12) max{n-^/”^ 2 (R)^^-i/ 2 j p 

then 6 ^{H,p) = 1/2. For cloud-forest graphs that are not thundercloud-forest we can at 
present only determine S^{H,p) in a smaller range: it follows from Proposition 4.3 and [1, 
Proposition 4.1] that 5^{H,p) = 1/3 if p = o(l) and p = In terms of lower bounds, 

we have S^{H,p) ^ 1/3 in the range (12) by Proposition 4.3, and S^{H,p) = 1 when p 
^-i/m 2 (R) Proposition 3.2; if m 2 {H) > 2 then these ranges overlap. In this case it is 
possible that these lower bounds are correct, though we cannot prove it. When m 2 {H) ^ 2 
we do not know what to expect in the gap between the two ranges. 

Question 6.3. Let H he a cloud-forest graph, and suppose that p = p{n) satisfies (12). Is 
6 ^{H,p) = 1/3 whenever H is a not a thundercloud-forest graph? 

Recall that for thundercloud-forest graphs, we do not even know how to determine S^{H,p) 
in the range p = see [1, Conjecture 1.6]. 

Finally, we do not know how to determine the behaviour of 6 ^{H,p) around the threshold 

p = ^~1/^2(R) 
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Problem 6.4. Determine forp= cn 

The construction used to prove Proposition 3.2 (that is, randomly remove one edge from 
each copy of H in G{n, p)) can easily be extended to give a lower bound in the range { 71 (H ), 1) 
for all sufficiently small c > 0. It would be interesting to understand how the extremal graph 
transitions (as c increases) from a random-like graph into a Turan-like graph. 

References 

[ 1 ] P. Allen, J. Bottcher, S. Griffiths, Y. Kohayakawa, and R. Morris, Chromatic thresholds in dense random 
graphs, Submitted (arXiv:1508.03870). 

[2] _, The chromatic thresholds of graphs, Adv. Math. 235 (2013), 261-295. 

[3] J. Balogh, R. Morris, and W. Samotij, Independent sets in hypergraphs, J. Amer. Math. Soc. 28 (2015), 
no. 3, 669-709. 

[4] J. Bottcher, Y. Kohayakawa, and A. Taraz, Almost spanning subgraphs of random graphs after adver¬ 
sarial edge removal, Combin. Probab. Comput. 22 (2013), no. 5, 639-683. 

[5] D. Conlon and W. T. Gowers, Combinatorial theorems in sparse random sets. Submitted 
(arXiv:1011.4310). 

[6] P. Erdos, Graph theory and probability, Ganad. J. Math. 11 (1959), 34-38. 

[7] P. Erdos and M. Simonovits, On a valence problem in extremal graph theory, Discrete Math. 5 (1973), 
323-334. 

[8] S. Janson, T. Luczak, and A. Rucihski, Random graphs, Wiley-lnterscience, New York, 2000. 

[9] J. H. Kim and V. H. Vu, Concentration of multivariate polynomials and its applications, Gombinatorica 
20 (2000), no. 3, 417-434. 

[10] Y. Kohayakawa, T. Luczak, and V. Rodl, On K'^-free subgraphs of random graphs, Gombinatorica 17 
(1997), no. 2, 173-213. 

[11] Y. Kohayakawa, T. Luczak, and V. Rbdl, Arithmetic progressions of length three in subsets of a random 
set, Acta Arith. 75 (1996), no. 2, 133-163. 

[12] M. Krivelevich, Bounding Ramsey numbers through large deviation inequalities. Random Structures 
Algorithms 7 (1995), no. 2, 145-155. 

[13] T. Luczak and S. Thomasse, Coloring dense graphs via VC-dimension, Submitted (arXiv:1007.1670vl). 

[14] J. Lyle, On the chromatic number of H-free graphs of large minimum degree. Graphs and Combinatorics 
(2010), 1-14. 

[15] D. Saxton and A. Thomason, Hypergraph containers. Invent, math. (2015), 1-68. 

[16] M. Schacht, Extremal results for random discrete structures. Submitted. 

[17] C. Thomassen, On the chromatic number of triangle-free graphs of large minimum degree, Gombinatorica 
22 (2002), no. 4, 591-596. 

[18] _, On the chromatic number of pentagon-free graphs of large minimum degree, Gombinatorica 27 

(2007), no. 2, 241-243. 

Peter Allen, Julia Bottcher 

Department of Mathematics, London School of Economics, Houghton Street, London, 
WC2A 2AE, UK. 

E-mail address: p.d.allenlj.boettcher@lse.ac.uk 

Simon Griffiths 

Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, 0X1 3TG, 
UK. 

E-mail address: griffith@stats.ox.ac.uk 


22 



Yoshiharu Kohayakawa 

Instituto de Matematica e Estati'stica, Universidade de Sao Paulo, Rua do Matao 1010, 
05508-090 Sao Paulo, Brazil. 

E-mail address: yoshi@ime.usp.br 

Robert Morris 

IMPA, Estrada Dona Castorina 110, Jardim Botanico, Rio de Janeiro, RJ, Brazil 
E-mail address: rob@impa.br 


23 



